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METHODS FOR IDENTIFYING COMPOUNDS THAT BIND TO A TARGET 

Background of the Invention 

Recent advances in methods for producing large libraries of peptides have 
5 provided unprecedented numbers of peptides which can be screened for pharmaceutical 
activity. Both chemical and biological methods for synthesis of peptide libraries have 
been reported. For example, libraries of peptides {e.g,. having 10^-10*- member 
peptides) can be displayed on the surface of bacteriophage (known as "phage display" 
libraries). Such peptide libraries can comprise all possible peptides of a given length 
1 0 {e.g., every one of the twenty natural amino acid residues at each position of a hexamer), 
or a subset of all possible peptides. Methods for screening large libraries of peptides, to 
identify those peptides that bind to a target, have also been developed, such as 
biopanning. These screening techniques allow for the isolation from a librarv- of one. or 
several, peptides that bind to a pre-selected target. By producing and screening large 
15 peptide libraries, it has become possible to rapidly search for peptides (e.g., ligands) that 
bind to a target {e.g., a receptor). Moreover, the structure of selected peptides can be 
determined with relative ease by standard sequencing methodologies {e.g.. sequencing of 
the peptides themselves or of a nucleic acid molecule encoding the peptide). 

Despite the advantages of peptide libraries (e.g., immense diversity and simple 
20 "deconvolution" of the peptide structure by sequencing), the use of this approach to 
identify peptides that bind a target for phannaceutical purposes has a nimiber of 
drawbacks. For example, the affinity of a selected peptide(s) for the target often is 
relatively low (eg, high enough to detect binding of the peptide to the target but too low 
for pharmaceutical potency). Moreover, peptides are not always suitable for therapeutic 
25 administration due to such problems as difficulties in formulation (due to insolubility), 
unfavorable pharmacokinetics and/or pharmacodynamics, and rapid degradation in vivo. 

Alternative to peptide libraries, libraries of non-peptide chemical compounds 
{e,g, peptidomimetics. peptide derivatives, peptide analogues, etc.) can be synthesized. 
Screening of a target with a non-peptide library may lead to the identification of a 
30 compound(s) with higher affinity for the target than that of a peptide selected by random 
peptide library screening and/or identification of a compound(s) with more desirable 
pharmacological properties than a peptide. However, the diversity of compounds that 
can be achieved by random chemical synthesis is considerably lower rfian that of random 
peptide library synthesis, thereby reducing the likelihood of identifying a high affinity 
35 target-binding compound from a randomly synthesized chemical librar>'. An additional 
disadvantage of a chemical library approach to identifying molecules that bind a target is 
thai determination of the structure of the compound<s) that binds the target {i.e.. 
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"deconvolution" of the compound structure) cannot be accomplished by a simple 
sequencing methodology but rather requires more complex chemical strategies, thereby 
limiting the number of identified compounds that can be efficiently analyzed. 

Improved methods for identifying compounds that bind a target that retain the 
advantageous properties of both peptide library screening and chemical library screening 
while reducing or eliminating the disadvantageous properties of these techniques are 
needed. 



Summary of the Invention 

The present invention features methods for identifying compounds that bind a 
target that combine the use of peptide-based libraries with the use of chemically-based 
libraries such that the advantages of each approach are maintained while many of the 
disadvantages of using either approach alone are overcome. For example, the methods 
of the invention provide the diversity and ease of deconvolution of traditional peptide 
library screening yet also provide for the identification of compounds with high affinity 
for the target and desirable pharmacological propenies. To optimize the benefits of both 
peptide-based and chemically-based libraries, the methods of the invention involve 
utilizing information obtained from screening a target with a first library comprising a 
mukiplicity of peptides in the design of a second library comprising a multiplicity of 
chemical {i.e., non-peptide) compounds. The target is then rescreened with this second 
library to identify compounds that bind to the target. 

The methods of the invention generally involve the following steps: 

a) forming a first library comprising a multiplicity of peptides; 

b) selecting from the first library at least one peptide that binds to the target; 

c) determining the sequence or sequences of the at least one peptide that binds to 
the target, thereby forming a peptide motif; 

d) forming a second library comprising a muhipiicity of non-peptide compounds 
designed based on the peptide motif; 

e) selecting from the second library at least one non-peptide compound that binds 
to the target; and 

0 detemiining the structure or structures of the at least one non-peptide 
compound that binds to the target; 

thereby identifying a compound that binds to the target. 

The first library is composed of peptides whose structures can be determined by 
standard sequencing methodologies (e.g., direct sequencing of the amino acids making 
up the peptides or sequencing of nucleic acid molecules encoding the peptide). Thus, 
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the first library provides the extensive diversity of peptide libraries and the ease of 
deconvoiuting the selected peptides. In contrast, the second, non-pepiide library 
preferably comprises compounds that, while not peptides, are structurally related to 
peptides, such as peptide analogues, peptide derivatives and/or peptidomimetics. The 
structure of the non-peptide compounds preferably is determined by a mass 
spectrometric method, most preferably by tandem mass spectrometry. Since the second 
library is designed based on the peptide motif generated from screening the first library, 
many of the disadvantages of traditional chemical libraries (such as reduced diversity 
and more laborious deconvolution methods) are reduced or eliminated, since the second 
library is "biased" toward compounds that have affinity for the target. This bias in the 
second library for compounds having affinity for the target means that fewer compounds 
need to be screened as compared to a random chemically-synthesized library and, 
accordingly, fewer compounds need to be analyzed structurally (/.e., deconvoluted). 

In a preferred embodiment, compoimds identified by screening of the second 
library have at least 10-fold higher affinity for the target than the peptides identified by 
screening the first library. More preferably, compounds identified by screening of the 
second library have at least 100-fold higher affinity for the target than the peptides 
identified by screening the first library. Even more preferably, compounds identified by 
screening of the second library have at least 1000-fold higher affinity for the target than 
the peptides identified by screening the first library. 

The methods of the invention can further involve additional library screening 
steps. For example, after compounds from the second library that bind the target have 
been identified, a third library can be formed that comprises a multiplicity of non- 
peptide compounds designed based on the structure or structures of the non-peptide 
compounds identified from the second library. The target can be rescreened with the 
third library to identify additional compounds that binds to the target. 

Another aspect of the invention pertains to a library comprising a multiplicity of 
non-peptide compounds designed based on a peptide motif, wherein the peptide motif is 
determined by selecting from a peptide library at least one peptide that binds to a target, 
determining the sequence or sequences of the at least one peptide that binds to the target 
and determining a peptide motif. 

Yet another aspect of the invention pertains to compounds identified by a method 
of the invention. In a preferred embodiment, the compound is a peptidomimetic. In 
other preferred embodiments, the compound that binds to a target has a binding affinity 
for the target, expressed as an apparent Kj. EC50 or IC50, of at least about 10-7 M, more 
preferably at least about 10-8 M, and even more preferably at least about 10-^ M. 
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Brief Description of the Drawing 

Figure 1 is a graph depicting the ability of compounds PPI-432. PPI-652 and 
PPI-654 to inhibit the binding of radiolabeled FGF to an anti-FGF antibodv. 



Detailed Description o f the Invei^ tii^^ 

The present invention pertains to methods for identifying a compound that binds 
to a target, as well compounds identified thereby, and libraries for use in the methods of 
the mvention. The methods of the invemion involve screening a target with at least two 
distincnibrariesrThe te-rni "tafget": as used herein, is intended to include molecules or 
molecular complexes with which compounds (e.g. , peptides or non-peptide compounds) 
can bmd or mteract. Exemplary targets include ligands. receptor s, hnnnnn.. cytokines, 
antibodies, antigens, enzymes, and the like. The target can be. for example, a purified 
compound or a partially purified compound or it can be associated with the surface of a 
cell that expresses the target. 

Ifl-ihe-metfiods-ofthe^nvemion^-targer is initially screened lvitF¥^Fiae-' 

imrary to generate a peptide motif for peptides that can bind to the target. Accordingly, 
the methods of the invention first involve: 

forming a first library comprising a multiplicity of peptides; 

selecting from the first library at least one peptide that bind^ to the target; and 

determining the sequence or sequences of the at least one peptide that binds to 
the target, thereby generating a peptide motif. 

The term "peptides", as used herein with regard to libraries, is intended to include 
molecules comprised only of natural amino acid residues {i.e.. alanine, areinine. aspartic 
acid, asparagine. cysteine, glutamic acid, glutamine, glycine, histidine. isoleucine. 
leucine, lysine, methionine, phenylalanine, proline, serine, threonine, tryptophan, 
tyrosine and valine) linked by peptide bonds, or other residues whose structures can be 
detemiined by standard sequencing methodologies (e.g., direct sequencing of the amino 
acids making up the peptides or sequencing of nucleic acid molecules encoding the 
peptide). The temi "peptide" is not intended to include molecules structurally related to 
peptides, such as peptide derivatives, peptide analogues or peptidomimetics. whose 
structures cannot be detemiined by standard sequencing methodologies but rather must 
be determined by more complex chemical strategies, such as mass spectrometric 
methods. 

The term "multiplicity", as used herein, refers to a plurality of different 
molecules (e.g., peptides or non-peptide compounds). Thus a "library comprising a 
multiplicity of peptides" refers to a library of peptides comprising at least two differem 
peptide members. In preferred embodiments, libraries of peptides useful in the present 
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invention include at least about 103 different peptides, more preferably at least about 10* 
different peptides and even more preferably at least about lO^ different peptides. 
Depending on the length of the peptide members and the efficiency of synthesis, library 
diversuy as hign as about 10'^ different peptides or even about 10'5 different peptides 
may be achievable. A library comprising a multiplicity of peptides for use in the 
methods of the invention can comprise all possible peptides of a specified length (i.e., a 
"complete" random library wherein each position of the peptide can be any one of the 
twenty natural amino acid residues, e.g., all possible hexapeptides). Alternatively, a 
peptide library can include only a subset of all possible peptides of a specified length by 
having non-degenerate positions within the peptide library (i.e., one or more positions 

i-the peptide which are occupied by only one, orarfewTdifferent amino acid" 
residue(s) within each peptide member of the library). Moreover, as the peptide length 
increases, it may not be possible to achieve every possible peptide permutation within 
^e library. - Preferably, at least about 105 to 10« permutations of all possible 
permutation S-OLa-randomized-peptide- are pres e nt within the lib r ary. Tli e-length-of the 
peptides used in the library can vary depending upon, for example, the degree of 
diversity desired and the particular target to be screened. For example, in different 
embodiments, the peptide library is made up of peptides not longer than about 30 amino 
acids long, not longer than about 20 amino acids long or not longer than about 12 amino 
acids long. Preferably, the peptide library is comprised of peptides at least 3 amino acids 
-long, and-more-preferably at least 6 amino acids long. 

A library comprising a multiplicity of peptides can be formed by any one of 
several methods known in the art. For example, in one embodiment, a multiplicity of 
nucleic acid molecules encoding a multiplicity of random peptides are synthesized and 
the nucleic acid molecules are introduced into a vector that allows for expression of the 
encoded peptide library. One examples of such a library is an "external" library in which 
the peptide library is expressed on a surface protein of a host, such as a "phage display" 
library (see, e.g.. Smith, G.P. (1985) Science 228:1315-1317; Parmley. S.F. and Smith. 
G.P. (1988) Gene 22:305-318; and Cwiria, S. ei al. (1990) Proc. Natl. Acad Sci. USA 
82:6378-6382). As used herein, a "phage display" library is intended to refer to a library 
in which a multiplicity of peptides is displayed on the surface of a bacteriophage, such as 
a filamentous phage, preferably by fiision to a coat protein of the phage (e.g., the pIII 
protein or pVIII protein of filamentous phage). In phage-display methods, a multiplicity 
of nucleic acid molecules coding for peptides is synthesized and inserted into a phage 
vector to provide a recombinant vector. Suitable vectors for construction of phage 
display libraries include fUSE vectors, such as fUSEl. fUSE2. fUSE3 and fUSES 
(Smith and Scott (1993) Methods EnzymoL 20:228-257). Nucleic acid molecules can 
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be synthesized according to methods known in the art (see, e.g., Cormack and Struhl. 
(1993) Science 262:244-248), including automated oligonucleotide synthesis . 
Following insertion of the nucleic acid molecules into the phage vector, the vector is 
introduced into a suitable host cell and the recombinant phage are expressed on the cell 
surface after a growth period. The recombinant phage can then be used in screening 
assays with a target (described further below). 

Another example of a peptide library encoded by a multiplicity of nucleic acid 
molecules is an "internal" library, wherein the peptide members are expressed as fusions 
with an internal protein of a host (i.e., a non-surface protein) by inserting the nucleic 
acid molecules encoding the peptides into a gene encoding the internal protein. The 
internal protein may remain intracellular or may be secreted by, or recovered from, the 
host. Examples of internal proteins with which peptide library members can be fused 
include thioredoxin. staphnuclease, lac repressor (Lacl), GAL4 and antibodies. An 
internal library vector is preferably a plasmid vector. In one example of an internal 
library, referred to as a two-hybrid system (see e.g., U.S. Patent No. 5.283,1 73 by Field; 
Zervos el al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046- 
12054; Bartel etal. (1993) Biotechniques 14:920-924; and Iwabuchi et al. (1993) 
Oncogene 8:1693-1696), nucleic acid molecules encoding a multiplicity of peptides are 
inserted into a plasmid encoding the DNA binding domain of GAL4 (GAL4db) such that 
a library of GAL4db-peptide fusion proteins are encoded by the plasmid. Yeast cells 
{e.g.,Saccharomyces cerevisiae YPB2 cells) are transformed simultaneously with the 
plasmid encoding the library of GAL4db-peptide fusion proteins and a second plasmid 
encoding a fusion protein composed of the target fused to the activation domain of 
GAL4 (GAL4ad). When the GAL4ad-target interacts with a GAL4db-peptide library 
member, the two domains of the GAL4 transcriptional activator protein are brought into 
sufficient proximity as to cause ttanscription of a reporter gene or a phenotypic marker 
gene whose expression is regulated by one or more GAL4 operators. 

In another example of an internal library (see e.g., U.S. Patents 5.270.181 and 
5292,646, both by McCoy), nucleic acid molecules encoding a multiplicity of peptides 
are inserted into a plasmid encoding thioredoxin such that a library of thioredoxin- 
peptide fusion proteins are encoded by the plasmid. The plasmid is introduced into a 
bacterial host cell where the thioredoxin-peptide fusion proteins are expressed 
cytoplasmically. The fusion proteins can be selectively released from the host cells (e.g., 
by osmotic shock or freeze-thaw procedures) and recovered for use in screening assays 
with a target. 

In yet another example of an internal library (described further in Cull. M.G. el 
al. (1992) Proc. Nail. Acad. Sci. USA 89: 1865). nucleic acid molecules encoding a 
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multiplicity of peptides are inserted into a gene encoding Lad to create a fusion gene 
encoding a fusion protein of Lad and the peptide library members. The plasmid 
encoding the fusion pr otein Hbrary members is designed such that the fusion proteins 
bind to the plasmid (i.e.. a plasmid encoding the Lad fusion proteins includes lac 
operator sequences to which Ucl binds) such that the fusion proteins and the plasmids 
encoding them can be physically linked. Following expression of the fusion proteins in 
host cells, the cells are lysed to liberate the fusion protein and associated DNA, and the 
-libraiy-is-screened-with-an-immobilized targetr Fusion proteins that biiid to the target are 
recovered and the associated DNA is reimroduced into cdls for amplification and 
sequencing, thus allow for determination of the peptide sequence encoded by the DNA. 

~ Altemaiivc lu funning a peptide library by syntfiesiziiig a multiplicity of nucleic 
acid molecules encoding the peptide library members, a multiplicity of peptides can be 
synthesized directly by standard chemical methods known in the art. For example, a 

_inuldpljc^^^ 

supports-fsee, e.g.. Lam. K.S. etal. (l993r)rBiootg. Med. Chem. Lett. l:4T9^2?jr~Other 
exemplary chemical syntheses of peptide libraries include the pin method (see, e.g., 
Geysen. H.M. etal. (1984) Proc. Natl. Acad Sci. USA 81:3998-4002); the tea-bag 
method (see, e.g.. Houghten. R.A. et al. (1985) Proc. Natl. Acad. Sci. 0X4 82:51 31- 
5135); coupling of amino acid mixtures (see, e.g., Tjoeng, F.S. et al. (1990) Int. J. Pept. 
Protein Res. 35:141-146; U.S. Patent 5,010,175 to Rutter et al.)- and synthesis of spatial 
arrays of compoundsTsee, e.g., Fodor, S.P.A. etal. (1991 ) Science 251:767). Peptide 
libraries formed by direct synthesis of the peptide library members preferably are bound 
to a solid support (e.g., a bead or pin. wherein each bead or pin is linked to a single 
peptide moiety) to facilitate separation of peptides tiiat bind a target from peptides tiiat 
do not bind a target 

A particularly preferred peptide library for use in tiie metiiods of the invention is 
an anchor library as described in U.S. Patem Application Serial No. 08/479,660, entitled 
Anchor Libraries and Identification of Peptide Binding Sequences, and corresponding 
PCT Application No. PCT/US96/09383, tiie entire contents of botii of which are 
expressly incorporated herein by reference. As used herein, tije term "anchor library- 
refers to a peptide library in which the peptides have non-continuous regions of random 
amino acids separated by specifically designated amino acid residues. Anchor libraries 
are therefore subsets of a complete libraiy of a specified length. Anchor libraries can be 
used to identify essential contacts between a ligand and a target, and have tiie advantage 
that only a subset of all possible peptides need be syntiiesized and screened. In a 
preferred embodiment, an anchor library is made up of peptides about 16 amino acids 
long. An anchor library can be prepared by genetic means (e.g., by synthesizing a 
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multiplicity of nucleic acid molecules encoding a multiplicity of anchor peptides) or by 
chemical means (e.g., by directly synthesizing a multiplicity of anchor peptides). 

Once the peptide library has been fonned. a target of interest is screened with the 
peptide library to identify one or more library members that bind to the target. Peptides 
that bind a target can be selected according to known methods, such as biopanning of an 
immobilized target with a phage display library. In one embodiment, a biotinylated 
target is immobilized on a streptavidin-coated surface either before or after contacting 
the target with a peptide library and unbound peptides are removed by washing. Peptide 
libraries bound to a solid support can be screened by, for example, contacting the 
peptides immobilized on the solid support with a labeled target and detecting the labeled 
target bound t o library members or. alternatively, by releasing th e peptides; fmm the 
solid support and assaying the resulting solution (see, e.g., Ohlmeyer. M.H.J. et al. 
(1993) Proc. Natl. Acad. Sci. USA 90:10922:10926). 

Following selection of one or more peptide library members that bind to the 
targetrthe^in o acid sequ ence of the peptide i?detennihe a"according to standard 
methods. For example, in one embodiment, the amino acid sequence of the peptide is 
determined by determining the nucleotide sequence of a nucleic acid molecule encoding 
the peptide and translating the encoded peptide using the genetic code. Nucleotide 
sequencing can be performed by standard methods {e.g., dideoxynucleotide sequencing 
or Maxam-Gilbert sequencing, either manually or using automated nucleic acid 
sequencers). Alternatively, in another embodiment, the amino acid sequence of the 
selected peptide(s) is detemiined by direct amino acid sequencing of the peptide (e.g., by 
Edman microsequencing, either manually or using automated peptide sequencers). 

Once the sequence(s) of the peptide(s) that bind the target selected from the first 
library has been determined, a peptide motif is generated based on these sequences. As 
used herein, the term "peptide motif is intended to include an amino acid consensus 
sequence that represents preferred amino acid residues vwthin a peptide that are 
sufficient or essential for binding of the peptide to the target. Typically, the simplest 
way to generate a peptide motif is to compare the amino acid sequences of all peptides 
selected from screening a target with the first peptide library and define a peptide motif 
based on one or more amino acid residues that are conserved within at least two of the 
selected peptides. If only a single peptide is selected from the initial peptide library 
screening, the amino acid sequence of this peptide can constitute a peptide motif. 
Alternatively, when multiple peptides are selected from the kiitial peptide library 
screening, the amino acid sequences of each of the selected peptides are optimally 
aligned and amino acid residues conserved among two or more of the selected peptides 
can constitute the peptide motif. In addition to. or alternative to. direct alignment and 
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analysis of the primary amino acid sequence of the selected peptides, a peptide motif can 
be generated by more sophisticated stnictural analysis of the selected peptides. For 
example, molecular modelling programs can be e mployed to determine stnictural moUfs 
present in the selected pepiide(s). Examples of such structural motifs include a-helix, P- 
tums. and the like (see, e.g., A. Fersht (1985) "Enzyme Structure and Mechanism", 2nd 
ed.. W.H. Freeman and Co.. New York). Computer modelling can also be used to 
calculate properties of active peptides such as hydrophobicity, steric bulk, stacking 
—interactions, dipole-moment,-and the like. Any of the above-mentioned properties can be 
included when generating a peptide motif 

In the methods of the invention, after a peptide motif has been generated for a 
target of interest based on scieening of the first library, the target is rescreened with a 
second, non-peptide library that is designed based on the peptide motif The second 
library can be composed of compounds that are designed to have improved properties 

-compared to the ;peptides:s^l;eaed^^^ increased 
a ffinity for -th e target (e.g.. pr e dict e d by computer modelling of the target with non- 
peptide compounds designed based on the peptide motiO and/or improved 
pharaiacological properties, such as increased solubility, decreased susceptibility to 
proteolytic degradation, increased biodistribution and the like. Accordingly, the 
methods of the invention further comprise the steps of: 

forming a second library comprising a multiplicity of non-peptide compounds 

—designed based on the'peptide motif; 

selecting from the second library at least one non-peptide compound that binds to 
the target; and 

determining the smicture or structures of the at least one non-peptide compound 
that binds to the target. 

The term "non-peptide compounds", as used herein, is intended to include 
compounds comprising at least one molecule other than a natural amino acid residue, 
wherein the structures of the compounds cannot be determined by standard sequencing 
methodologies but rather must be determined by more complex chemical strategies, such 
as mass spectrometric methods. Preferred non-peptide compounds are those that, 
although not composed entirely of natural amino acid residues, are nevertheless related 
structurally to peptides, such as peptidomimetics. peptide derivatives and peptide 
analogues. As us ed herein , a "derivative" of a compound X (e.g.. a peptide) refers to a 
form of X in which one or more restive groups on the compound have been derivatized 
with a substituem group. Examples of peptide derivatives include peptides in which an 
amino acid side chain, the peptide backbone, or the amino- or carboxy-ierminus has been 
derivatized {e.g., peptidic compounds with methylated amide linkages). As used herein 
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an "analogue" of a compound X refers to a compound which retains chemical structures 
of X necessary for functional activity of X yet which also contains certain chemical 
structures which differ from X. An example of an analogue of a naturally-occurring 

-pepii Je is a peptide which includes one or more non-naturaiiy-occurring amino acids. 
As used herein, a "mimetic" of a compound X refers to a compound in which chemical 
structures of X necessary for functional activity of X have been replaced with other 
chemical structures which mimic the conformation of X. Examples of peptidomimetics 
include peptidic compounds in which the peptide backbone is substituted with one or 
more benzodiazepine molecules (see e.g.. James. G.L. ei ai (1993) Science 260:1937- 
1942) and "retro-inverso" peptides (see U.S. Patent No. 4,522.752 by Sisto). described 

further below. '. _ 

The term mimetic, and in particular, peptidomimeric. is intended to include 
isosteres. The term "isostere" as used herein is intended to include a chemical structure 
that can be substituted for a second chemical structure because the sieric conformation of 

"the first st ructure fits a binding "si"te"specificTorthe^nnH nir^^rf The term 
specifically includes peptide back-bone modifications (i.e., amide bond mimetics) well 
known to those skilled in the art. Such modifications include modifications of the amide 
nitrogen, the a-carbon, amide carbonyl, complete replacement of the amide bond, 
extensions, deletions or backbone crosslinks. Several peptide backbone modifications 
are known, including ^[CHjS], v|/[CH2NH]. ^[CSNHj], v[NHCO], V|;[C0CH2], and V 
[(E) or (Z) CH=CH]. In the nomenclature used above, v indicates the absence of an 
amide bond. The structure that replaces Uie amide group is specified within the brackets. 
Other examples of isosteres include peptides substituted with one or more 
benzodiazepine molecules (see e.g. . James. G.L. et al. ( 1 993) Science 260: 1 937- 1 942). 
peptoids (R.J. Simon el al. (1992) Proc. Nail. Acad Sci. USA 89:9367-9371 ). and the 
like. 

Other possible modifications of peptides include an N-alkyl (or aryl) substitution 
(H/[C0NR]). backbone crosslinking to construct lactams and other cyclic structures, or 
retro-inverso amino acid incorporation (\|/[NHCO]). By "inverse" is meant replacing L- 
amino acids of a sequence with D-amino acids, and by "retro-inverso" or "enantio-retro" 
is meant reversing the sequence of the amino acids ("retro") and replacing the L-amino 
acids with D-amino acids. For example, if tiie parent peptide is Thr-Ala-Tyr. die retro 
modified form is Tyr-Ala-Thr. the inverso form is thr-ala-tyr, and the retro-inverso form 
is t>'r-ala-thr (lower case letters refer to D-amino acids). Compared to the parent peptide, 
a retro-inverso peptide has a reversed backbone while retaining substantially the original 
spatial conformation of the side chains, resulting in a retro-inverso isomer with a 
topology that closely resembles the parent peptide. See Goodman ei al. 'Perspectives in 
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Peptide ChemfsYry"' pp. 2iT-294 (1981 ).' See also U.S. Patent No. 4,522.752 by Sisto for 
further description of "retro-inverso" peptides. 

Approaches to designing peptide analogues, derivatives and mimetics are known 
in the art. For example, see Farmer, P.S. in Drue Design f E.J. Ariens. ed \ Academip 
Press, New York, 1980. vol. 10, pp. 1 19-143; Ball. J.B. and Alewood, P.F. (1990) J. 
Mol. Recognition 3:55; Morgan, B.A. and Gainor, J.A. ( 1 989) Ann. Rep. Med. Chem. 
24:243; and Freidinger, R.M. ( 1 989) Trends Pharmacol. Set. Ifi:270. 

The second, non-peptide library can be formed by methods known in the art for 
combinatorial synthesis of organic compoimds. For example, a second library 
comprising compounds that include modified amirio acids (for example, D-amino acids 
ersymhetic amino-acids-such as phenylglycine) can be synthesized by techniques used 
for the synthesis of peptide libraries (e.g., solid support methods described supra). 
Other organic molecules that have been synthesized on solid supports include 
benzodiazepines (B.A. Bunin and -J.-A. A". Ellihan ( 1 992) J. Am . Chem. Soc. 11 4: 1 0997- 

10-998),-peptoids-(RTN^^U6keHaanft^f-fl/T-ft99 2)y. Am. Ch em-Soc-nA: 1 0646- 1 0647), 

pepudyl phosphonates (D.A. Campbell and J.C. Bermak (1994) J. Org. Chem. 59:658- 
660), vinylogous polypeptides (M. Hagihara et a/. ( 1 992) J. Am. Chem. Soc. 114:6568- 
6570), and the like. An alternative synthetic scheme foj chemical libraries involves 
synthesis of compounds on resui beads wherein a coding moiety corresponding to each 
addition in the synthesis is also coupled to the bead (see e.g., Brenner. S. and Lemer. 
R7A7 (1992) Proc. Natl. Acad Sci. USA 89751 81 :5183rOhlmeyer,"M:H.L. etal (1993) 
Proc. Natl. Acad Sci. USA 90: 1 0922: 1 0926; Still et al. , PCT publication WO 
94/0805 1 ). In a preferred embodiment, the second library comprises compounds which 
include at least one peptide bond (i.e.. amide bond). In a preferred embodiment, the 
second library is a library of peptidomimetics. 

Preferably, the second library comprises at least about 102 different compounds, 
more preferably at least 10* different compounds, and still more preferably at least 10^ 
different compounds. Depending upon the size of the non-peptide compounds in the 
library and the efficiency of synthesis, it may be possible to achieve a second library 
comprising as many as 1 0« different compounds or even lO'O different compounds. 

After formation of the second library, the target of interest is screened with the 
second library, e.g , by the screening methods described above for screening the fiist 
library. One or more non-peptide compounds that bind to the target are thereby selected. 
Preferably/a rion-peptide compo^ from the secbndTibraiy that binds to a 

target has a binding affmity for the target, expressed as an apparent (dissociation 
constant). EC50 (concentration needed for 50% effective binding) or IC50 (concentration 
needed for 50% inhibition of binding of another compound that binds to the target) of at 
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east about 10-7 M. more preferably at least about 10-8 m. and even more preferably at 
least about 10-9 M. In a preferred embodiment, compounds identified bv screening of 
the second library have at least lO-fold higher affinity for the target than' the peptides 
- identified by-screenin^«Hibn^^^ 

screemng of the second library have at least 1 OO-fold higher affinitv for the target than 
the pepndes identified by screening the first library. Even more preferably, compounds 
idemified by screening of the second library have at least lOOO-fold higher affinity for 
the target than the peptides identified by screening the first library. 

Following selection of one or more compounds from the second librarv that bind 
to the target, the structure of the selected compound(s) is detemiined. In a preferred 
embodiment, the structure of the non-pe Bid.e_compound(s) is determined by the use of a 
mass spectrometric method. Mass spectrometric methods allow for the rapid 
inexpensive, and highly accurate identification of the structure of a compound based on 
themass of the compound and on fragments of the compound generated in the mass 
specuumeter. A pre ferred mass spectrometriFt^hni^ ueLs tandem mass spectrometry 
sometimes denoted "MS/MS". In tandem mass spectrometry, a sample compound is ' 
first ionized and the molecular ion determined. The molecular ion is then cleaved into 
several smaller fragments, which are then mass-analyzed. The use of mass spectrometry 
to Identify the structure of high-molecular weight compounds, including peptides, has 
been reported (see. e.g., R.S. Youngquist et al. (1995) J. Am. Chen,. Soc. m sWO- B J 
Egnere/a/. (1995)J. Org. CAem. 6Q:2652-2653). It is believed that tandem mass 
spectrometry is especially usefiil for the analysis of non-peptide compounds that contain 
one or more peptide bonds {e.g., peptide derivatives, peptide analogues and/or 
peptidomimetics) because the peptide bond can be cleaved in the spectrometer to 
produce fragmems that can be analyzed to identify particular subunits of the compound 
In certain alternative embodiments, it may be possible to analyze at least a portion of a 
non-peptide compound by direct amino acid sequencing, e.g., by Edman degradation 
(e.^.. where the non-peptide compound comprises a peptide portion). Alternatively in 
embodiments in which the second library is synthesized in an array (e.g., on pins or in an 
array on a solid surface, e.g., a "chip"), the structure of the compound can be detemiined 
by the position the compound occupies in the airay. In yet other embodiments, in which 
the second librar>- is an encoded library (i.e., a library in which the structure of the 
chemical compound has been encoded on a bead, as described in Bremier. S. and Lemer 
R.A. (1992) Free. Nail. Acad Sci. USA 89:5181-5183; Ohimeyer, M.H.L. e, al (1993) 
Proc. Nail. Acad Sci. USA 9Q: 10922: 10926; and Still et al.. PCT publication WO 
94/0805 1 ), the structure of the compound can be determined by decoding the encoding 
moiety. 
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In a particulariypreferred7mb6dim7nrof^^^^ method of the invention, the first 
(peptide) library is a phage display library, and the non-peptide compound(s) of the 
second library that bind to the target are analyzed by tandem m ass spectrometry. In 
another particularly preferred embodiment of the methods of the invention, the first 
(peptide) library is an anchor library, and the compound(s) of the second library that 
bind to the target are analyzed by tandem mass spectrometry. 

The skilled artisan will appreciate that the compound or compounds identified 
from the second library-can be used as a basis for forming furtheriibraries that can be 
used for further screening of the target. That is, the information gained from the 
screening of the second library can be used to design another motif, for example a 

-modtfied-peptide motif {e.g., a motif based on the structure of peptide derivatives, 

peptide analogues and/or peptidomimetics). and a subsequent, third library can be 
formed comprising compounds designed based on the motif generated from the 
-scre ening-ofthe-second libraryrThfe-targetis then screenedwith the third library and 
active compounds identified as-p^eviousl)^^scrib^iIeIein. This process can be 
-repeated until a compound with a desired binding affinity for the target is obtained. 

Another aspect of the invention pertains to a compound identified by the method 
of the invention. In preferred embodiments, the compound is a peptidomimetic, peptide 
derivative or peptide analogue. Preferably, a compound identified by the method of the 
invention has a binding affinity for the target, expressed an an apparent Kj (dissociation 
constant). EC50 (concentration needed for 50% effective'finding) 0MC50 (concentration 
needed for 50% inhibition of binding of another compound that binds to the target) of at 
least about 10-7 M, more preferably at least about 10-« M. and even more preferably at 
least about 10-9 m. The binding affinity of a compound for a particular target can be 
determined by standard methods for determining Kjs, EC50S or IC50S. 

Another aspect of the invention pertains to a library comprising a multiplicity of 
non-peptide compounds designed based on a peptide motif, wherein the peptide motif is 
determined by selectmg from a peptide library at least one peptide that binds to a target, 
determining the sequence or sequences of at least one peptide, preferably multiple 
peptides, tiiat binds to tiie target and determining a peptide motif A library of non- 
peptide compounds based on a peptide motif can be synthesized by the methods 
previously described herein. In a preferred embodiment, the non-peptide compounds of 
^^ibrar>' are peptidomimetics. Additionally or alternatively, the non-peptide 

compouiidscairBe"pepti'dedenvirivM ar»d?or'i^ptide analogue^ Prefeiablyy Uie library 
comprises at least about 102 compounds, more preferably at least about 10^ compounds 
and even more preferably at least about 10^ compounds. In one embodimem. the 
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multiplicity of non-peptide compounds arc attached to a solid support, such as a plurality 
of resin beads. 

This invention is further illusttated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 

EXAMPLE 1 

In this example, the method of the invention is used to identify one or more 
compounds that bind to a target that is expressed on the surface of a cell, the luteinizing 
hormone releasing hormone receptor (LHRH-R), a member of the G-protein coupled, 
seven transmembrane receptor superfamily. 

Construction of the First Library 

A phage anchor library comprising a multiplicity of peptides is used as the first 
library in the method. The anchor library is comprised of peptides having random amino 
acid residues distributed throughout domains of alanine (Ala) and/or glycine (Gly) 
residues. For example, the anchor library can be composed of peptides that are sixteen 
amino acid residues in length and have the amino acid sequence: 

X « (Ala/Gly)4X2(Ala/Gly)4X3(Ala/Gly)4X4 
wherein X'. X2. X^and X'* can be any amino acid residue and each can be the same or 
different from the others. 

To prepare the anchor library, a multiplicity of oligonucleotides encoding the 
peptides are synthesized by standard methods, such as the split synthesis method (See 
e.g., Cormack and Struhi (1993) Science 2^:244-248). Synthesis of oligonucleotides 
for construction of anchor libraries also is described further in U.S. Patent Application 
Serial No.08/479,660, entitled Anchor Libraries and Identification of Peptide Binding 
Sequences, and corresponding PCT Application No. PCT/US96/09383, the entire 
contents of both of which are expressly incorporated herein by reference. 

Following synthesis, assembled oligonucleotide inserts are cloned into the 
pfUSES phage vector (Smith and Scott (1993) Methods in Enzymology 2)7 ■22i-257), 
which allows for expression of the encoded peptides as fusions with the pIII phage coat 
protein. The vector (30 jig) is prepared by cleaving with 200 units of endonudease Sfil 
in 500 ^l of restriction buffer (Buffer #2 from New England BioLabs (NEB), Beveriy, 
MA) for 10 hours. The reaction is teiminated with addition of 1 5 mM EDTA. followed 
by phenol/chloroform extraction. The vector DNA is recovered by isopropanol 
precipitation, resuspended in 500 ^l of Tris-EDTA (TE) buffer and recovered by ethanol 
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precipitation. The phage vector is ligated to the assembled oligonucleotide inserts at 5 
jig/ml vector and three-fold excess assembled insert in ligation buffer (NEB) with 100 
units of T4 DNA Hgase at 10° C for 16 hours. DNA is purified from the ligation buffer 
by phenol/chloroform extractions* followed by ethanol precipitations and resuspension 
in TE buffer. 

DNA from the ligation reaction is transformed into electrocompetent MCI 061 
bacterial host cells (Wertman et ai (1986) Gene 49:253-262) using 0.5 \xg of DNA per 
1 GO 111 of cells using-0.2 cm electroporator cells and a BioRad elecux)porator set at 25 
\xY, 2.5 KV and 200 ohms. Shocked cells are recovered in SOC media, grown out at 
37 ""C for 20 minutes and inoculated into LB broth containing 20 ^ig/ml tetracycline. 

Library phage released from the transtormed bacterial host cells are isolated after 
growing the bacterial cells for 16 hours. Phage are separated from cells by 
centrifugation at 4 X at 4.2 K rpm for 30 minutes in a Beckman J6 centrifuge, followed 
"b y^ secondTentrifu g atioirof the su^^ at 4.2K rpm for mimite<: Phage are 

precipitated with the axidttion of 150 ml of 16.7 % polyethyleneglyc©r(PEG)73:3"M~ 
NaCl per liter of supernatant. Mixed solutions are incubated at 4 ^'C for 16 hours. 
Precipitated phage are collected by centrifugation at 4.2K rpm in a J6 centrifuge, 
followed by resuspension in 40 ml of Tris-buffered saline (TBS). Resuspended phage 
are precipitated again with the addition of 4.5 ml of PEG solution for 4 hours. Phage are 
collected at 5K rpm in a Beckman JA20 centrifuge at 4** C. Phage are suspended in 7 ml 
^fTBS anUbrougfifto 1.3 mg/mrdehsity bythe"additidn"of 1 gm of CsCl per 2.226 gm 
of aqueous solution. Phage are subjected to equilibrium centrifugation in a type 80 rotor 
at 45K rpm for 40 hours. Phage bands are isolated, diluted 20-fold with TBS and 
pelleted at 40K rpm in a type 50 rotor. Pellets are resuspended in 0.7 ml of TBS and as 
is in screening assays, described below, at approximately 3 x 10^3 phage/ml. 

Screening of the First Library - 

To identify members of the phage anchor library that bind to LHRH-R, 
monolayers of cells expressing LHRH-R (such as CHO, COS or SF9 cells transfected to 
express LHRH-R) adhered to culture dishes are biopanned with the phage librar\'. The 
phage (in TBS) are incubated vrith the cells for I hour at 4 X and non-specific phage are 
removed by v^rashing the cell monolayer with PBS containing 2 % milk or 1 % BSA or 
10 % serum for a total of 7 washes over 30 minutes. The remaining phage that are 
bound to the cells (by way of binding to LHRH-R on the surface of the cells) are 
recovered by elution with 100 \M glycine, pH 2.2 for 10 minutes. Eluted phage are 
neutralized with 1 M Tris base. 
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Eluted phage are amplified by infection of log phase K9I £. coli (Lvons and 
Zinder (1972) Virology 49:45-60; Smith and Scott (1993) Methods in Erm^moiogy 
20:228-257). Approximately 105 phage are amplified by infecting an equal volume of 
K91 cells with phage at 22 "C for 10 minutes. Infe gierclllrare diluted into 1 ml of LB 
broth for 30 minutes at 37 »C. followed by an additional dilution with 9 ml of LB 
containing 20 ng/ml tetracycline and grown overnight. Phage are then separated from 
cells by centrifugation and purified by PEG precipitation and resuspended at 10'2 
phage/ml. 

To ftirther enrich for peptides that specifically bind to LHRH-R. amplified phage 
can be subjected to two additional rounds of biopamiing using different cell types 
_expressingLHRH-R in each round of panning and using the binding-^d-amplifieation 
conditions described above. 



Generation of a Peptide Motif 

btop anningrihdividual phage are isolated and seq uenced m tpvp^I th^ PNA 

sequence that encodes for the displayed peptide in each selected phage. Sequencing is 
performed by standard methods (e.g., dideoxy sequencing using Sequenase 2.0. United 
States Biochemical Co., Cleveland OH, according to the manufacturer's protocol). 

After obtaining the DNA sequences encoding the selected peptides, the DNA 
sequences are optimally aligned to generate a peptide motif. The peptide motif is 
determined from the amino acid residues that arc conserved in at least two of the 
selected peptides. For example, if biopanning of the anchor library leads to selection of 
four peptides having the following amino acid sequences (standard three-letter 
abbreviations are used for amino acids): 



Ser-(Ala/Gly)4-Arg-(Ala/Gly)4-Leu-(AIa/Gly)4-Met (SEQ ID NO: 1) 
Ser-(Ala/Gly)4-Lys-(Ala/Gly)4-Leu-(Ala/Gly)4-Gln (SEQ ID NO: 2) 
Phe-(Ala/Gly)4-Arg-(Ala/Gly)4-Leu.(Ala/Gly)4-Thr (SEQ ID NO: 3) 
Ser-(Ala/Gly)4-Asn-(Ala/Gly)4-Leu-(Ala/Gly)4-Ile (SEQ ID NO: 4) 

a peptide motif can be generated having Uie amino acid sequence: 

Ser-(Ala/GIy)4.Arg-(Ala/Gly)4.Leu-(AIa/Gly)4-Xaa (SEQ ID NO: 5) 
(wherein Xaa can be any amino acid residue). 
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Construction of a Second Library 

Based on the peptide motif generated from screening the target with the first 
library^ a second library co m prisin g a multi plicity of non-p eptide compounds is 
synthesized by standard chemical synthesis methods (see e.g.. Youngquist. R.S. et al. 
(1995) J. Am. Chem. Soc. 117:3900-3906; Till, J.H. et al (1994) J. Biol Chem. 
269:7423-7428; Herman, J. et ai (1992) 1 Biol. Chem. 267:1434-1437). The non- 
peptide compounds of the library are designed to mimic the peptide motif. For example, 
to-create'a-non--pepiideiibrary-basedi5nihep^^ acid 
derivatives, analogues or mimetics of Ser at position 1 , Arg at position 6. Leu at position 
1 1 and/or Xaa at position 16 can be incorporated into the library. Derivatives, analogues 
and/or mimetics ot the repeating Ala/Gly structure can also belncorporated into the 
library. 

One example of a second library synthesized based on the above-described 
— L„pep.uae_niotifjs.an.anar6gJibraryJnJw — 
substitured^th homoserine, cyanoalanme, isoglutamme or isoasparaginerthe arginine 
at position 6 of the motif is substituted with citrulline, isopropyllysine. homoarginine, 
ornithine, homocitrulline, diaminoproprionic acid, aminobenzoic acid or nitroarginine, 
the leucine at position 1 1 of the motif is substituted with NorLeu. BuGIycine, 
cyclohexylalanine, norval, aminobutyri or various N-methyl aliphatic amino acids and 
the Xaa at position 16 of the motif is combinatorialiy derived from the twenty natural 
amino acids or standard analogs thereof. 

Another example of a second library synthesized based on the above-described 
peptide motif is a library consuiicted to probe the stereochemical specificity of 
compounds that bind to the target by alternating D- and L-amino acids in the library. In 
this case, the library is constructed using the following L-amino acids: Glu. Arg, Asn, 
Thr. VaL Pro, Met. Tyr and His; and the following D-amino acids: Asp, Lys. Gln^ Ser, 
Cha, Ala. Phe and Trp. The library also contains glycine. This library can define the 
role of D or L stereochemistry within the selected peptide motif. 

Yet another example of a second library synthesized based on the above- 
described peptide motif is a mimetic library, wherein reduced amide mimetics are 
incorporated into the compounds of the library via the use of appropriate amino acid 
aldehyde precursors and the solid phase reductive amination procedure for assembly 
(Sasaki and Coy (1987) Peptides 8:1 19-120). Mimetics can be incorporated at one site 
or multiple sites within the library. Appropriate positions include sites within a peptide 
motif containing an aliphatic or aromatic residue, such as the leucine at position 1 1 of 
the above-described peptide motif. 
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Once synthesized, the library is dissolved in 1-5 % dimeihylsulfoxide (DMSO) in 
water and used in screening assays as described below. 

-ScreeninP of the Seonn^ H HtY 

To identify members of the second library that bind to LHRH-R, membranes of 
CHO cells that have been transfected to express LHRH-R on their surface are prepared. 
One liter quantities of CHO-LHRH-R cells (e.g., 1 0^ cells/liter) are grown and 
harvested. The cells are lysed with a nitrogen bomb (see e.g., Autuori, F. et ai. (mi) J. 
CellSci. 57:1-13). 25 ml of a washed cell suspension (in an isomolar Hanks balanced 
salt solution/20 mM HEPES buffer, pH 7.4) is placed in a nitrogen bomb at 4 X with 
continuous stirring by a magnetic stir bar, and the p re«n». U »>Hj..stH t o 4-50 0 psi. 
followed by continuous stirring for 20 minutes. Pressure is released into a 50 ml plastic 
centrifuge tube containing a lOOX cocktail of protease inhibitors (0.5 mM PMSF. 10 
Mg/ml benzamidine, 1 \xglm\ leupepiin. fmal concentrations). The homogenate is 

. cg ngifuged forl houf at 5.000 Xg. The supematam is subjected to ultracemrifu gation at 
50.000 Xg for one hour. The final pellet is resuspended to a concentration of about 1 
mg/ml and aliquots are frozen in liquid nitrogen until used in binding assays, at which 
time the aliquots are thawed. 

Binding reactions are set up in 12 x 75 mm polypropylene test tubes containing a 
sample of the second library (final concentration of 10 mg/ml). binding buffer (final 
concentrations: 10 mM Tris, W% bovine serum albumin, pH 7.4) and a sample of the 
cell membrane preparation (approximately 10' cell equivalents per tube) in a total 
volume of 500 ^l. The binding reaction is incubated on ice for 90 minutes. The binding 
reaction is terminated by fast filtration binding of the mixture using a 12-weII cell 
harvester (Millipore, Milford, MA). Filters (Whaonan glass-fiber filters GF/C) are 
prewashed three times with 300 ^l of 10 mM HEPES, 0.01 % sodium azide. 3 ml of 
HEPES buffer is added to each binding reaction tube and the contents of the tube are 
poured over the filter in the fast filtration binding apparatus. Two additional aliquots of 
buffer are added to each tube and poured over the filter. Compounds from the second 
library that bind to the LHRH-R membrane preparation are retained on the filter, 
whereas compound that do not bind to the LHRH-R membrane preparation are removed. 

Identification of Comn ounds that Rind the Tarp pt 

Compounds from the second library that bind to the LHRH-R membrane 
preparation are recovered from the filter of the fast filtration binding apparatus. The 
structures of the selected compounds are determined by tandem mass spectrometry (see 
e.g.. Hunt D.F.. et al. (1985) Ww/. Chem. 57:765-768: Hunt. D.F. et al. (1986) Proc. 



wo 97/22617 



PCTAJS96/20S6r 



-19- 

Natl. Acad. ScCUSA 81:6233-6237; Hunt. D.F. et al (1987) Proc. Natl Acad. Sci. USA 
84:620-623; Biemann. K. (1990) Methods in Enzvmolosv 193:455-479: Amotu D. et al. 
(1993) C/w. Chem. 39:2005-2010: Metzger. J.W. etat. (\994) Anal . Biochem. 219:26 1 - 
277; Brummel, C,L. et al. (1994) Science 264:399-402V 

EXAMPLE 2 

In this example, the method of the invention was used to identify compounds that 
bind to a fibroblast growth factor (FGF) binding protein, namely an anti-FGF 
monoclonal antibody. Starling with biologically generated peptide libraries in 
bacteriophage M 1 3 . important amino acids for target binding were defined. Based on 
-the-rtrfbrmation-Beneraied from the bacteriophage library, a combinatorial chemical 
library with a complexity of approximately 160.000 compounds was designed and 
synthesized. This combinatorial library contained biased amino acid residues at key 
positionsr A-combiimion"of natural amino acids.(;.c.. the 20 amino acids encoded by 

-DNA-Vand^ynthetic-amino-acidstlhose containing unnatural R-groups, or the D- 

enantiomer of a natural amino acid) were utilized in the chemical library. Non-peptide 
compounds that interacted with the target were recovered and were analyzed. Several of 
these selected non-peptide compounds were synthesized and were shown to be 10-100 
fold more potent than the starting natural peptide for target binding. These non-peptide 
compounds contained unnatural amino acids at 3 or more positions. This example 
demonstrates that novel, high potency non-peptide compounds containing unnatural 
amino acids can be discovered using phage display library screening coupled with 
combinatorial chemistry library screening. 

Derivation of a Peptide S equence Motif from a Phage Display Library 

A monoclonal antibody raised against human basic FGF (bFGF) was used as a 
biopanning target using a 7-mer peptide library in a bacteriophage Ml 3 vector. To 
prepare the biopanning plates, four wells of an Immulon4 microliter plate (Dynaiech) 
were coated with stepavidin (1 ng/well) in 100 mM NaHCOj, pH 9.5, for 2 hours at 
loom temperature or overnight at 4»C. The strepavidin-coated wells were washed three 
times with phosphate-buffered saline (PBS). To each well containing strepavidin, 
biotinylated rat anti mouse antibody k light chain (PharMingen) was added, at 1 ^g/well, 
in PBS for 30 minutes to 1 hour at room temperature. The wells were then washed again 
with PBS three times. Non-specific binding sites were blocked with 300 ^1/well PBS, 
1 % dry milk for 1 hour at room temperature. The wells were then washed with PBS, 
0.1% dry milk three times, loaded with 50 ^l/weIl PBS, 0.1% dry milk and stored at 
4''C. 
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Monoclonal antibody to bFGF (Sigma) was added to four tubes in 100 nl of 
PBS, 0.1% dry milk at the following final concentrations: 100 nM. 25 nM. 5 nM and no 
antibody (control). To each tube containing anti-bFGF antibody, approximately 1x1 0'O 
phage-from a seven-mer peptide-library-were-add ed. The tubes were left at room 
temperature for 2 hours to allow for specific interaction between the phage and the anti- 
bFGF antibody. 

After the 2 hour room temperature incubation of the phage with the antibody, the 
PBS. 0.1% milk buffer was removed from the wells of the strepavidin^iotinylated 
a-mouse complex plates and the corresponding 100 pi of each tube containing the 
antibody/phage mixture was added to each of the four wells . The solutions were 
allowed to sii in the wells at room temperature for twen ty minntp<; To remove unbound 
or non-specifically bound phage, each well was washed with cold PBS. 0.1% Tween six 
times, followed by cold PBS washes (six times). Specifically-bound phage were eluted 
with 100 fil Glycine pH 2.2 for 10 minutes at room temperature. The glycine solution 
was then removed'from wellsMfl"aaaed to polypropylene tubes w ith 6 ur2M tris base 
to neutralize. The phage eluant was titered and the fractional yield determined. 

For phage amplification, phage were mixed with concentrated mid log phase 
£. coli strain K91 at room temperature for 5 to 10 minutes. One milliliter of LB was 
added and the cell were'grown at 37°C for 30 rain. Nine milliliters of LB«« was then 
added and the cells were allowed to grow overnight at 37"C. The following morning, 
the bacteria were pelleted by centrifugation at 5000 rpm for 15 minutes. The supernatant 
was drained into a fresh 50 ml conical tube and 1 .5 ml of PEG/NaCl was added. The 
tube was chilled at 4'>C for 4 hours. The phage were pelleted by centrifugation at 8000 
rpm for 30 minutes. The supernatant was drained off and the phage pellet was 
resuspended in 1 ml PBS. The phage amplification was titered and lxlO>o phage were 
used for subsequent rounds of screening as described above. 

A number of related phage were selected by this biopanning procedure. The 
selected peptides, as deduced from DNA sequence analysis of the phage, indicated that 
the target protein binds to peptides containing the consensus sequence: P-x-G-H-x-K-x 
(SEQ ID NO: 6). Analysis of the bFGF sequence indicated that the natural epitope for 
the antibody is P-P-G-H-F-K-D (SEQ ID NO: 7), based on the strong similarity to the 
peptides selected from the phage display library. Two peptides containing this sequence 
were synthesized: PPM 1 6 = G-A-F-P-P-G-H-F-K-D-P-D-R-L (SEQ ID NO: 8) and 
PPI-432 = P-P-G-H-F-K-D (SEQ ID NO: 7) and tested for their ability to bind to the 
target. Competitive binding experiments demonstrated that PPM 16 blocked phage 
containing the sequence P-R-G-H-W-K-Q (SEQ ID NO: 9) firom binding to the 
antibody. Funhermore both PPI.416 and PPI-432 blocked the binding of bFGF to the 
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antibody. Therefore, peptides deduced from phage biopanning bound to the target 
protein and blocked the interaction of bFGF. The peptide sequence information obtained 
froni ph^e biopanning was ap plied to the synthesis of a biased combinatorial peptide 



Design and Synthesis of Combinatorial Chemical Libraries 

Sequence information derived from phage binding to the target protein, together 
with the known sequence of bFGF, was used in the design of a secondary biased 
chemical combinatorial library. Two libraries were synthesized containing a diflferent 
number of fixed and variable amino acids: 



In these libraries, the first 2 or 4 amino acids were fixed to match the amino acids 
presumed to be involved in target interaction, and the remaining positions were variable. 
Natural amino acids, L-amino acids with unnatural R-groups, and D-enantiomers of 
natural amino acids were incorporated into the library. Accordingly, the members of 
these libraries are referred to herein as "non-peptide compounds". The natural and 
unnatural amino acids chosen at the variable positions were biased based on the 
sequences obtained from phage display. For example, many difTerent amino acids were 
found in the last residue (carboxy-terminal) from the parmed phage (position 7), 
Therefore, 24 different natural and unnatural amino acids were used at this position. 
Likewise, at the penultimate position (position 6), most phage contained a lysine residue. 
Therefore, this position was biased in the library synthesis by using predominantly basic 
natural and unnatural amino acids. 

The following abbreviations for unnatural amino acids are used herein: 

Abbreviation Residue 

Abu 2*amino butyric acid 

Nor-Val norvaline 

Hyp hydroxyproline 

Cit citrulline 



library. 



Library I 
Librar y II 



Sequence Complexitv 
F-P-G-H-x-x-x (SEQIDNO: 10) 1.600 
E=E=x=x-x=xIx_(SEQ:idIN(^^^ -160,000 



Nal 



3-(2-naphthyl)alanine 
ornithine 



On) 
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3-(3'-pyridyl)aianine 
cyclohexylalanine 

1 ,2.3.4-teirahydroisoquinoline-3-carboxyIic acid 
para-fluorophenylalanine 

The composition of the combinatorial library is summarized below in Table 1 . 



Table 1 



Amino 
Acid 


Residue 
MW 


Library 
AA7 


Library 
AA6 


Library 
AA5 


Library 

AA4 


Library 

AA3 


Library 
AA2 


T ihrjiri/ 

AAl 




















Gly 


57.1 


X 








X 






L-Abu 


85.1 


X 




X 










L-Pro 


97.1 


X 








X 


X 


X 


L-NorVal 


99.1 


X 




X 










L-Thr 


101.1 


X 






X 








L-Hyp 


113.2 


X 






X 


X 






L-Asp 


115.1 


X 






X 








L-Lys 


128.2 


X 


X 




X 








L-Met 


131.2 


X 




X 










L-His 


137.2 


X 


X 


X 


X 


X 






L-Phe 


147.2 


X 




X 










L-Arg 


156.2 


X 


X 




X 


X 






L-Cil 


157.2 


X 


X 




X 


X 






L-Tyr 


163.2 


X 




X 










L-Nal 


197.3 


X 




X 










D-Ala 


71.1 


X 




X 










D-Ser 


87.1 


X 






X 








D-Om 


114.2 


X 


X 




X 








D-Glu 


129.1 


X 






X 








D-Pal 


148.2 


X 


X 


X 


X 


X 






D-Cha 


153.2 


X 




X 










D-Tic 


159.2 


X 




X 




X 






D-pF-Phe 


165.2 


X 




X 










D-Trp 


186.2 


X 




X 











X- denotes this amino acid was used in the library construction 



Pal 

Cha 

Tic 

pF-Phe 



wo 97/22617 



PCTAJS96/20S6I 



-23 



An additional consideration in the design of the library was to use amino acids of 
different molecular weight at any one position. The sequence of compounds selec ted 
'from the library ultimately is determined using fragmemation and MS analysis. It is 
therefore important to choose natural and unnatural amino acids with distinct molecular 
weights to avoid ambiguity in the sequence determination. 

-The-library-was-synthesized-using-equimolar mixtures of each natural or 



unnatural amino acid at each synthetic step using standard Fmoc chemistry. As a 
representative example, for Library II (P-P-x-x-x-x-x; SEQ ID NO: 1 1). the strategy 
invoivcdnec of 1 .7 mmoles of a PEG=PA-tresin-(4:5 g-at-OTJS mmol/g), C-terminally 
modified with an amide, at the start of synthesis. Synthesis using the natural and 
unnaniral amino acids indicated in Table 1 is carried out through five cycles (i.e.. amino 
acid positions 7-3 as indicated in Table 1). At lhis.point. 0.1-mmole-of-the-resin-is- 
removed (1/17 of total) and the synthesis-ts-compitetjKl~with Pro foTpdsition 2 (AA2 as 
indicated in Table 1), Pro for position 1 (AAl as indicated in Table 1) and an N-terminal 
acetylation to prepare a test library of approximately 2 x 105 complexity. For coupling, 
depending on the diversity of each step, individual amino acids are weighed to provide a 
total of 3.4 mmoles per cycle. The combined amino acids are dissolved in 20 ml of 
NMP so that 10 ml (equivalent) is used for the first coupling and 10 ml for the second 
coupling. The first coupling uses HBTU/HOBT chemistry and the second coupling uses 
DCC/HOAt coupling strategy. Limiting amounts of Fmoc amino acids are used at each 
cycle to achieve uniform distribution of each natural or unnatural amino acid. Coupling 
efficiency is checked with ninhydrin reagem after each cycle of synthesis to assure the 
reactions went to completion. Following symhesis, the library is cleaved from the resin, 
precipitated and lyophilized. To remove scavengers, a chromatography step is 
performed following lyophilization. The final library is evaluated by HPLC to assure 
the complexity of the library. 

Library Characterization 

Each complete library, containing a mixture of approximately 1600 non-peptide 
compounds (Library I) or 160.000 non-peptide compounds (Library II), was analyzed by 
MS. The mass spectrum of each library was consistent with the distribution of 
molecular weights predicted from the diverse amino acids used for the synthesis. 

The activities of the libraries were tested using an FGF binding assay. In 
comparison to the peptide PPM32 (P-P^G-H-F-K-D; SEQ ID NO: 7). the unselected 
Library I was approximately 10-fold less potem in blocking the binding of FGF to the 
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target protein. Similarly, the unselected Library II was about 1000-fold less potent that 
PPI.432. These data are consistent with the libraries containing a mixture of diverse 
non-peptide compounds whose average potency is decreased relative to the original 

P ^ P" < *g "PO" ^v h i ch the libra r y w a^ U e Mgued. Since Libranes I and ITaFe highly diverse 

the active non-peptide compounds should have a spectrum of potencies which 
cumulatively yield the average potency seen in the library. Selective enrichment of non- 
pept.de compounds that interact with the target should be proponional to their binding 
affinities, allowing recovery of the most active components in the library. 

Selection of Non-PeptiH^ rnmDounH<; fnr Fu n ctional Analy ^i^ 

__ The target FGF binding protein (a-FGF-mAh) was biotinyl ated-using^etivatech- 

N-hydroxy-succmimide ester. The non-peptide compound library (500 ^g) was 
incubated with the biotinylated target antibody (1 7.5 Mg) in 500 ^1 phosphate buffered 

^saline (PBS) at 4-C for 2 hours. Bound non-peptide compounds were recovered in 
g°»°Pfa^-with-Thg^tib oay^^ on magnetic strep tnviHin h..Hc .c r......,. 

Magnetic streptavidin beads (100 ^l beads per reaction) were immobilized on a 
magnet and washed twice with PBS. The beads were then resuspended in 100 nl PBS 
per sample. Washed beads (100 m) were added to each antibody-library binding 
reaction and the mixture^was incubated for 15 minutes at 4* C with constant rotation 
The tubes were spun briefly to clear beads off the wall of the tube and the beads were 
then washed three tim.es with PBS. The beads were then pelleted -bv-centrifugation or by 
use of a magnet and the PBS removed. To elute bound non-peptide compounds. 100 jil 
of 10% acetic acid was added and the mixture was incubated at 4« C for 15 minutes 
The beads were immobilized on the magnet and the supernatant, containing the elated 
non-peptide compounds, was recovered. 

The eluted non-peptide compounds were brought to dryness under vacuum. 
Recovered non-peptide compounds were dissolved in 50 pi water and tested for tiieir 
ability to block the imeraction of bFGF witii the target protein. (Assuming 100% 
recovery of non-peptide compounds and 100% binding to bivalem antibody, the amount 
of recovered non-peptide compounds was calculated to be 200 ng). Binding experimems 
demonstrated that relative to the unselected library, the potency of the selected 
population of non-peptide compounds was increased by approximately 1000-fold. 

Selection of Non-peptid&Comnounds4or MFi ^ n d Seouence Analy cfc 

Selection of peptide for liquid chromatography/mass specu-ometry (LC/MS) 
analysis and sequence analysis was performed as described above, except 50 fig of 



wo 97/22617 



PCT/US96/20561 



-25- 

antibody and 1 mg of library was used. Selected non-peptide compounds were eluted 
with acetic acid and dried by iyophiiization. 

The selected non-peptide compounds were first analyzed by coupled LC /MS. 

The results showed the presence of non-peptide compounds with various 
chromatographic elution times. Individual fractions across the chromatographic gradient 
were next analyzed by MS. Different fractions contained a relatively small number of 
non-peptide compounds with distinct molecular masses. For example, chromatographic 

fraction-20-5-21-5eontained-a-major-M/Z-ionof-485.7rThisli-6^^ 

corresponds to a non-peptide compound with a molecular mass of approximately 971. 
This peak was fragmented and analyzed by tandem MS in order to deduce the structure 
of the non-peptide compound. The fragmentation pattern of this peptide together with 
the known natural and non-natural amino acids used in the library synthesis allowed for 
the unambiguous determination of the structure of the non-peptide compound, which 
structu re is as foHo wis: Pro-Pro-Gl yJiis-NAkLys..NaU.SEQ-ID-NO: 12). In a similar 
manner, the structures of several other major MS peaks were determined and are 
summarized below in Table 2: 



Table 2: Structure of Compounds that Bind to Anti-FGF Monoclonal Antibody 





1 


2 


3 


4 


5 


6 


7 


SEOID 

NO: 


(PPI-432) 


Pro 


Pro 


Gly 


His 


Phe 


Lys 


Asp 


7 


(PPI-652) 


Pro 


Pro 


Gly 


His 


Nal 


Lys 


Nal 


12 


(PPI-654) 


Pro 


Pro 


Gly 


His 


Nal 


Lys 


D-pF-Phe 


13 




Pro 


Pro 


Gly 


His 


Phe 


Lys 


Nal 


14 




Pro 


Pro 


Gly 


His 


Nal 


Lys 


Abu 


15 




Pro 


Pro 


Gly 


His 


Nal 


Lys 


D-Ala 


16 




Pro 


Pro 


Pal 


X 


X 


X 


X 


17 



Acti\ itv of Non-Peptide Compounds Selec ted from th«> f^econdarv rnmhinamrial 
Chemical Library 

Several of the non-peptide compounds selected from the combinatorial library 
were made synthetically and compared to the starting peptide (PPI-432) for their ability 
to bind to the target protein. The compounds were synthesized with an amino-terminal 
acet> 1 group and a carboxy-terminal amide. The potency of two non-peptide compounds 
(PPI-652 and PPI-654. shown in Table 2), containing modified amino acids at positions 
5 and 7. were compared to the starting peptide PPI.432 in their ability to inhibit 
radiolabeled FGF binding to biotinylated anti-FGF antibody. The results of this 
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experiment are shown in Figure 1. which demonstrate that PPI-652 and PPI.654 exhibit 
10-100-fold higher binding affinity for FGF than the staning peptide PPM32. Thus, the 
recovery of non-peptide compounds from diverse combinatorial libraries following 
selection against target correlates with the potency of those non-peptide compounds in 
target binding. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



( i ) RPPLTCANT ; 



<A) NAME: PRAECIS PHARMACEUTICALS INCORPORATED 

(B) STREET: ONE HAMPSHIRE STREET 

(C) CITY: CAMBRIDGE 

(D) STATE: MASSACHUSETTS 
10 (E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 02139-1572 

(ii) TITLE OF INVENTION: Methods for Identifying Compounds 

that Bind to a Target 

(iii) NUMBER OF SEQT7ENCE5; 17 ■. '. 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

20 _ _ STREET: 60 Stat e Street, suit e 510 

( C) CITY: B oston 



15 



(D) STATE; Massachusetts 



25 



(E) COUNTRY: USA 

(F) ZIP: 02109-1875 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

30 (D) SOFTWARE: Patentin Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

35 (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/573,786 

(B) FILING DATE: 18 -DEC- 1995 



40 



45 



50 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DeConti . Giulio A., Jr. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE/DOCKET NUMBER: PPI-012CPPC 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (617)227-7400 
(B> TELEFAX: (617)227-5941 



(2) INFORMATION FOR SEQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

* I ix) FEATXJRE : ' 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2-5,7-10,12-15 

(D) OTHER INFORMATION: /note= Xaa is Ala or Gly 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Ser Xaa Xaa Xaa Xaa Arg Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Met 
15 10 15 

(2} INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 16 amino acids 

<B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
30 (B) LOCATION:, 2-5,7-10,12-15 

(D) OTHER INFORMATION: /note= Xaa is Ala or Gly 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

35 Ser Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Gin 

1 5. 10 15 



(2) INFORMATION FOR SEQ ID N0:3: 

40 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

50 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2-5,7-10,12-15 

(D) OTHER INFORMATION: /note= Xaa is Ala or Gly 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



wo 97/22617 



PCT/US96/2056I 



15 



20 



25 



35 



55 
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Phe Xaa Xaa Xaa Xaa Arg Xaa Xaa-Xaa Xaa Leu Xaa Xaa Xaa Xaa Thr 
1 5 .10 15 



(2) INFORMATION FOR SEP ID NO; 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: -peptide 



(v) FRAGMENT TYPE: internal 

(ix) FEATURE: ' 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2-5,7-10,12-15 

(D) OTHER INFORMATION: /note= Xaa is Ala or Gly 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ser Xaa Xaa Xaa Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa lie 
15 10 15 



<2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



<ix) FEATURE: 

(A) NAME/KEY: Modified- site 
40 (B) LOCATION: 2-5,7-10,12-15 

(D) OTHER INFORMATION: /note= Xaa is Ala or Gly 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
45 (B) LOCATION: 16 

(D) OTHER INFORMATION: /note« Xaa is any amino acid 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

50 Ser Xaa Xaa Xaa Xaa Arg Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

15 10 15 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 
10 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2, 5, 7 

(D) OTHER INFORMATION: /note= Xaa is any amino acid 



15 



30 



45 



{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Pro Xaa Gly His Xaa Lys Xaa 
1 5 



20 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Pro Gly His Phe Lys Asp 
1 S 



(2) INFORMATION FOR SEQ ID NO:B: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

50 Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Asp Arg Leu 

15 10 



(2) INFORMATION FOR SEQ ID NO: 9: 

55 

(i) SEQXJENCE CHARACTERISTICS: 
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(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

5 (ii) MOLECULE T YPE: peptide 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



10 



25 



30 



35 



45 



Pro Arg Gly His Trp Lys Gin 
1 5 



15 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 



(i i) MOLECULE TYPE: peptid e 
(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5-7 

(D) OTHER INFORMATION: /note« Xaa is any amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Pro Pro Gly His Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
50 (B) LOCATION: 3-7 

(D) OTHER INFORMATION: /note« Xaa is any amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



55 Pro Pro Xaa Xaa Xaa Xaa Xaa 

1 5 
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(2) INFORMATION FOR SEQ ID NO: 12: 

5 (i) SEQUENCE CHARACTERISTICS: 

^-A>-LENGTH: 7 amino acids 

IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 
15 (A) NAME/KEY: Modif ied-site 
(B) LOCATION: 3r 7 
(i)h-OTHER' INFORMATION: /note= Xaa is Nal 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro Pro Gly His Xaa Lys Xaa 

1 - .... . . 5 



25 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



35 



40 



45 



50 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note» Xaa is Nal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note^ Xaa is D-pF-Phe 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Pro Gly His Xaa Lys Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 14: 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
_ 5 (v)_FRAGiyffiNT-_TYP„Ej^.internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

10 (D) OTHER INFORMATION: /note= Xaa is Nal 

(Xi) SEQUENCE DESCRIPTION:- SEQ ID NO: 14: 

Pro Pro Gly His Phe Lys Xaa 
15 1 5 

I 



(2) INFORMATION FOR SEQ ID NO: 15: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TyPE:_ amino acid 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 
30 (A) NAME/KEY: Modified-site 

(B)- LOCATION: 5 

(D) OTHER INFORMATION: /note= Xaa is Nal 

(ix) FEATURE: 
35 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note* Xaa is Abu 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Pro Pro Gly His Xaa Lys Xaa 
1 5 



45 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

50 _ (D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

55 

(ix) FEATURE: 
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(A) NAME/KEY: Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= Xaa is Nal 

5 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= Xaa is D-Ala 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Pro Pro Gly His Xaa Lys Xaa 
1 5 



15 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

* (■ii)~M0LEcuii:~TypE7npeptr«ie~ 

25 (v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

30 (D) OTHER INFORMATION: /note:^ Xaa is Pal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4-7 

35 (D) OTHER INFORMATION: /note* Xaa is any amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l7: 



40 



Pro Pro Pal Xaa Xaa Xaa Xaa 
1 5 
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.35. 
CLAIMS 

1 . A method for identifying a compound that binds to a target, the method 
comprising: 

a) forming a first library comprising a muUiplicity of peptides; 

b) selecting from the first library at least one peptide that binds to the target; 

c) determining the sequence or sequences of the at least one peptide that binds to 
the target, thereby generating a peptide motif; 

d) forming a second library comprising a multiplicity of non-peptide compounds 
designed based on the peptide motif; 

' - e) selectiTigiTom"the"second"library'at"least'^one non-peptide compound that binds 

to the target; and 

f) determining the structure or structures of the at least one non-peptide 
compound that binds to the target; 

thereby identifying a compound that binds to the target. 

2. The method of claim 1, wherein the first library is a phage display library. 

3. The method of claim 1 , wherein the first library is bound to a solid-support. 

4. The method of claim 1 , wherein the first library is an anchor library. 

5. The method of claim I , wherein the first library comprises at least about 1 0^ 
peptides. 

6. The method of claim 1 , wherein the first library comprises at least about 10^ 
peptides. 

7. The method of claim 1, wherein the first library comprises at least about 10^2 
peptides. 

8. The method of claim 1 , wherein step c) comprises determining the nucleotide 
sequence of a nucleic acid molecule or molecules that encode the at least one peptide. 

9. The method of claim 1 , wherein step c) comprises determining the amino acid 
sequence or sequences of the at least one peptide. 
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1 0. The method of claim 1 . wherein the second library comprises at least one peptide 
derivative. 



1-h Ttarm«iherof claim 1 , wherein the second library comprises at least one peptide 
analogue. 

12. The method of claim 1 , wherein the second library comprises at least one 
peptidomimetic. 

13. The method of claim 1. wherein the second library comprises at least about 102 
Don-peptide compounds.- _ 



14. The method of claim I , wherein the second library comprises at least about 1 0^ 
non-peptide compounds. 



15. The method of claim 1 , wherein the second library comprises at least about 1 06 
non-peptide compounds. 

1 6. The method of claim 1 . wherein step f) comprises analyzing the at least one non- 
peptide compound by a mass specttomeiric method. 

1 7. The method of claim 1 6, wherein the mass spectrometric method comprises 
tandem mass spectrometry. 

18. The method of claim I , wherein the compound that binds to a target has a 
binding affinity for the target, expressed as an apparent K^, EC50 or IC50. of at least 
about 10-7 M. 

1 9. The method of claim 1 , wherein the compound that binds to a target has a 
binding affinity for the target, expressed as an apparent Kj, EC50 or IC50, of at least 
about 10-8 

20. The method of claim I , wherein the compound that binds to a target has a 
-binding affinity for the target, expressed as an apparent Kj. EC50 or IC50. of at least 
about 10-9 M. 
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21. The method of claim 1 , wherein the at least one non-peptide compound that 
binds to the target as selected in step e) has at least a 10-fold higher affinity for the target 
than the at least one peptide t hat binds the target as selected in step b). 

22. The method of claim 1 , wherein the at least one non-peptide compound that 
binds to the target as selected in step e) has at least a lOO-fold higher affinity for the 
target than the at least one peptide that binds the target as selected in step b): 

23. The method of claim 1 , wherein the at least one non-peptide compound that 
binds to the target as selected in step e) has at least a 1000-fold higher affinity for the 
target than the at least one peptide that binds the target as selected in step b). 

24. The method of claim 1 , further comprising: 

g) forming a third library comprising a multiplicity of non-peptide compounds 
designed-based on the stnicture or structures of the non-peptide compound or 
compounds determined in step f); 

h) selecting fromi the third library at least one non-peptide compound that binds 
to the target; and 

i) determining the structure or structures of the at least one non-peptide 
compound selected in step h); 

thereby identifying a compound that binds to the target 

25. A method for identifying a compound that binds to a target, the method 
comprising: 

a) forming a first library comprising a multiplicity of peptides displayed on the 
surface of a bacteriophage; 

b) selecting from the first library at least one peptide that binds to the target; 

c) determining the sequence or sequences of the at least one peptide that binds to 
the target, thereby generating a peptide motif: 

d) forming a second library comprising a multiplicity of non-peptide compounds 
designed based on the peptide motif; 

e) selecting from the second library at least one non-peptide compound that binds 
to the target: and 

0 determining the structure or structures of the at least one non-peptide 
compound that binds to the target by tandem mass spectrometry; 
thereby identifying a compound that binds to the target. 
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26. A method for identifying a compound that binds to a target, the method 
comprising: 

a) forming a first library comprising an anchor library of a multiplicity of 
peptides; 

b) selecting from the first library at least one peptide that binds to the target; 

c) determining the sequence or sequences of the at least one peptide that binds to 
the target, thereby generating a peptide motif; 

d) forming a second library comprising a muhiplicity of non-pepiide compounds 
designed based on the peptide motif; 

e) selecting from the second library at least one non-peptide compound that binds 
to the target; and 

0 determining the structure or structures of the at least one non-peptide 
compound that binds to the target by tandem mass spectrometry; 
thereby identifying a compound that binds to the target. 

27. A compound identified by the method of claim 1 , 

28. The compound of claim 27, which is a peptidomimetic. 

29. The compound of claim 27, which binds to the target with a binding affinity, 
expressed as an apparent K^j, EC50 or IC50. of at least about 1 0-7 M. 

30. The compound of claim 27, which binds to the target with a binding affinity, 
expressed as an apparent Kj, EC50 or IC50, ofai least about 1 0"^ M. 

31 . The compound of claim 27, which binds to the target with a binding affinity, 
expressed as an apparent K^j, EC50 or IC50, o^^at least about 10-^ M. 

32. The compound of claim 27, which binds to the target with at least a 1 0-fold 
higher affinity than the at least one peptide that binds the target as selected in step b). 

33. The compound of claim 27. which binds to the target with at least a 100-fold 
higher affinity than the at least one peptide that binds the target as selected in step b). 

34. The compound of claim 27, which binds to the target with at least a 1 000-fold 
higher affinity than the at least one peptide that binds the target as selected in step b). 
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35. A library comprising a multiplicity of non-peptide compounds designed based on 
a peptide motif, wherein the peptide motif is determined by selecting from a peptide 
library at least one peptide that binds to a target, determining the sequence or sequences 
of the at least one peptide that binds to the target and determining a peptide motif. 

36. The library of claim 35. wherein the library comprises at least one 
peptidomimetic. 



37. The library of claim 35, wherein Uie library comprises at least about I 02 non- 
peptide compounds. 



38. The library of claim 35, wherein Uie library comprises at least about 10* non- 
peptide compounds. 



. 3g^Th e l i hra r y of c l aim 3 5 . v vh e r e in -the4tbrary-comprises-at-least about 10* non- 
peptide compounds. 

40. The library of claim 35, wherein the multiplicity of non-peptide compounds are 
attached to a solid support. 
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